Model Selection

Multi-scenario application

# Multi-scenario application

Voc2vec Hubert Ls Pt

voc2vec is a foundational model specifically designed for non-verbal human data, built on the HuBERT framework and pre-trained on 125 hours of non-verbal audio data.

Audio Classification

Transformers English

Blip Large Long Cap

A long-text image description generator fine-tuned based on BLIP, suitable for text-to-image prompts and image dataset annotation

An anime-style Stable Diffusion model fine-tuned from Anything V3, supporting high-quality image generation with danbooru tags

Image Generation English

Image Captioning Portuguese

This model converts images into Portuguese descriptions, trained on ViT and GPT2 architectures.

Image-to-Text Other

adalbertojunior

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase